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ABSTRACT: 

A system monitors the performance of a communications network and isolates the location of 
equipment failures therein through analysis of performance error data. The system passively 
monitors the performance of the network, operating in the background during normal data and/or 
voice transmission. The head end generates and inserts multiple error codes, one for each 
remote unit, into the downstream traffic path. Each remote unit extracts and decodes its 
respective error code to determine whether a downstream error occurred. Each remote unit then 
calculates a new error code based on the result of the downstream decode and the respective 
upstream data message. The head end receives the upstream error codes from all the remote 
units, decodes them, and accumulates the error data from all remote units over a period of time. 
An error distribution is generated, and an analysis is performed thereon to isolate the location of 
error causing equipment failures. The information thus obtained can then be used to initiate 
facility and equipment protection and/or appropriate maintenance procedures. The invention 
results in an increased quality of transmission with minimal network down time and minimal 
impact on overall system performance. 
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(57) Abstract 

A system monitors the performance of a communications network and isolates the location of equipment failures therein through 
analysis of performance error data. The system passively monitors the performance of the network, operating in the background during 
normal data and/or voice transmission. The head end generates and inserts multiple error codes, one for each remote unit, into the 
downstream traffic path. Each remote unit extracts and decodes its respective error code to determine whether a downstream error occurred. 
Each remote unit then calculates a new error code based on the result of the downstream decode and the respective upstream data message. 
The head end receives die upstream error codes from ail the remote units, decodes them, and accumulates the error data from all remote 
units over a period of time. An error distribution is generated, and an analysis is performed thereon to isolate the location of error 
causing equipment failures. The information thus obtained can then be used to initiate facility and equipment protection and/or appropriate 
maintenance procedures. The invention results in an increased quality of transmission with minimal network down time and minimal impact 
on overall system performance. 
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PERFORMANCE MONITORING AND FAILURE ISOLATION IN A 
POINT-TO-MULTIPOINT COMMUNICATION NETWORK 

Field of the Invention 

5 This invention relates generally to the field of 

communications systems, and more particularly to a 
system for monitoring the performance of a passive 
distribution network connected in a point-to-multipoint 
configuration and for isolating the location of 
10 equipment failures therein. 

Background of the Invention 
Deployment of passive distribution systems, such as 
optical fiber, in the local telecommunications loop has 

15 opened opportunities for new types of services. Most of 
the new services target data transmission instead of 
voice. The major difference in the network requirements 
for digital data transmission versus voice transmission 
is in the quality of the transmission. The demand for 

20 high quality transmission and low outage time is forcing 
the new generation of systems to protect equipment as 
well as the passive distribution facilities. 

The term "performance monitoring" is related to the 
quality of transmission over a network. One measure of 

25 a network's performance or transmission quality is the 
bit error rate (BER). Because of imperfections in the 
network and environmental conditions some data errors 
inevitably occur. However, equipment failures such as 
breakages, power loss, etc., can also cause data 

30 transmission errors to be introduced. BER is a measure 
of the number of errors which occur in a certain number 
of bits of transmission. For example, in a typical 
network the maximum acceptable BER is 10~ 10 . This value 
means that one transmission error is allowed every 10 10 

35 bits. If the BER is greater than 10' 10 the quality of 
transmission is not acceptable. 

In order to determine the BER means must be provided 
to detect errors that occur. Another desirable feature 
is a mechanism to pinpoint the locations of the 
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equipment failure which caused the errors so that 
equipment and protection facility switching can be 
achieved, or appropriate maintenance procedures 
performed . 

5 A point- to-multipoint passive distribution network 

configuration consists of a head end connected to a 
single data path which splits into multiple branches, 
each branch associated with a unique remote unit. 
Communication between the head end and the multiple 
10 remote units is multiplexed on a passive distribution 
network (PDN) and each remote unit is programmed to. 
extract and send data in a unique time slot. This means 
that all remote units share the single, or "feeder" 
section of the network, and that each have a dedicated 
15 branch, or "distribution" section of the network 
associated with it. One exemplary passive optical 
system is described in U.S. Patent Number 4,977,593, to 
Ballance, issued December 11, 1990 and assigned to 
British Telecommunications, which is incorporated herein 
20 by reference. 

Performance monitoring of such a configuration 
represents a major challenge because the indication of 
an error in the error code does not itself identify the 
particular equipment which is causing the errors. 
25 Because the multiple remote units share the feeder 
section of the PDN, in existing systems it is not 
possible to identify whether an error was introduced in 
the shared feeder, in a dedicated distribution branch or 
in one of the remote units themselves. 
30 Typically, the location of an equipment failure i9- 

determined by downing all or part of the system and 
performing interactive diagnostic tests between the head 
end and the multiple remote units. This method, 
however, results in an undesirable and severe 
35 degradation in performance of the system as a whole. 
There is therefore a need in the art for a means of 
monitoring the performance of a PDN connected in a 
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point -to-multipoint configuration, which can isolate the 
location of equipment failures with minimal impact on 
overall system performance. 

5 Summary of the Invention 

To achieve the goals described above, the present 
invention provides a system which monitors the 
performance and isolates the location of equipment 
failures in a PDN network arranged in a point-to- 
10 multipoint configuration. The system passively monitors 
the performance of the network, operating in the 
background during normal data and/or voice transmission. 
The system unintrusively monitors the system for errors 
and isolates whether an equipment failure responsible 
15 for generating the errors is located in the shared 
feeder section of the PDN, in one of the dedicated 
distribution sections of the PDN or in one of the remote 
units themselves. 

To isolate the location of equipment failures in the 
20 network, the present invention recognizes that different 
equipment failures in a point-to-multipoint 
configuration result in unique error distributions as 
seen across all remote units over time. The invention 
uses these error distributions to isolate the location 
25 of error causing equipment failures in the network. 

To generate the error distributions, the head end 
generates and inserts multiple downstream error codes, 
one for each remote unit, into the downstream traffic 
path. The remote units extract and decode their 
30 respective error code to determine whether a downstream 
error occurred. Each remote unit then sets an internal 
error flag indicating the result of the downstream 
decode result. Upstream error codes are generated by 
each remote unit from the respective upstream data 
35 message. The upstream error codes are then set to 
indicate an error if a downstream error was received. 
The head end receives the upstream error codes from all 
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the remote units, decodes them, and accumulates the 
error data from all remote units over a predetermined 
period of time. An error distribution representing the 
distribution of errors occurring across the entire 
5 system is generated from the accumulated error data. 
The error distribution is analyzed via software data 
processing methods to identify the type of error 
distribution occurring and to isolate therefrom the 
location of equipment failures which introduce the 

10 errors into the system. The location is isolated to 
either the shared feeder section of the PDN, to one of 
the distribution sections of the PDN or to one of the 
remote units. The information thus obtained can then be 
used to initiate facility and equipment protection 

15 and/or maintenance procedures. The system therefore 
results in an increased quality of transmission with 
minimal system down time and minimal impact on overall 
system performance. 

20 Brief Description of the Drawings 

In the drawings, where like numerals refer to 
like elements throughout the several views s 

Figure 1 shows a representative 
telecommunications system in block diagram form; 
25 Figure 2 shows a simplified block diagram of a 

representative telecommunications system, showing the 
locations of the feeder and distribution sections of the 
PDN network; 

Figure 3 shows a block diagram of the relevant 
30 portions of HDT 300; 

Figure 4 shows a block diagram of the relevant 
portions of RU 600; 

Figure 5 shows a first type of error 
distribution which can occur in a point-to-multipoint 
35 system configuration; 

Figure 6 shows a second typ of error 
distribution; 



WO 94/29983 



PCT/US93/11048 



5 

Figure 7 shows a third type of error 
distribution; 

Figure 8 shows a fourth type of error 
distribution; 

5 Figure 9 shows a flow diagram of the error data 

processing methods used to isolate the location of 
equipment failures; and 

Figure 10 shows a flow diagram of the polling 
scheme used to determine whether an error was caused by 
10 a failure in the upstream or downstream data traffic 
path. 

Detailed Description of the Preferred Embodiment 
In the following detailed description of the 

15 preferred embodiment, reference is made to the 

accompanying drawings which form a part hereof, and in 
which is shown by way of illustration a specific 
embodiment in which the invention may be practiced. 
This embodiment is described in sufficient detail to 

20 enable one skilled in the art to make and use the 

invention. It will be understood that other embodiments 
may be utilized and that structural changes may be made 
without departing from the spirit and scope of the 
present invention. The following detailed description 

25 is therefore not to be taken in a limiting sense, and 
the scope of the invention is to be defined by the 
appended claims. 

A general description of a telecommunications system 
will now be given. Figure 1 shows a representative 

30 telecommunications system 100 in block diagram form, ""it 
shall be understood that other configurations are 
possible without departing from the scope of the present 
invention. The system includes a headend, or Host 
Digital Terminal (HDT) 300 , which acts as an interface 

35 between a Local Digital Switch (LDS) 200 and multiple 
Passive Distribution Networks (PDN's) 500. In the 
preferred embodiment, HDT 300 receives and transmits 
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signals to LDS 200 using the well-known transmission 
format described in Bellcore document TR-TSY-000008, 
known as the TR-8 transmission format. The primary 
signal carried over this format is a DS1 signal, and the 
5 transmission system is the Tl digital transmission 
system. A DS1 signal consists of 24 groups of 
multiplexed eight-bit samples (DSO's) and one framing 
bit. Each eight-bit sample or DSO represents an 
individual channel (a telephone conversation, for 
10 example) . It shall be understood, however, that the 
performance monitoring scheme of the present invention 
could also be used in systems utilizing other interface 
formats such as OC-X, STS-X, etc., or a multitude of 
other transmission formats, without departing from the 
15 scope of the present invention. 

Each HDT 300 includes termination points for the DS1 
lines coming into the FDT 300, circuitry for converting 
from TR-8 format to PDN format, and circuitry which 
routes the incoming channels to the appropriate remote 
20 unit. The routed channels are then multiplexed to form 
the downstream traffic signals which are sent over PDN's 
500 to their respective destinations. 

The downstream optical signals are transmitted over 
PDN's 500, which in the case of a passive optical 
25 network consist of a network of optical fiber and 
passive optical splitters, and which terminate at a 
series of Remote Units (RU) 600 which are in the 
preferred optical system embodiment of an Optical 
Network Units, or ONU's. In the preferred embodiment, 
30 which uses optical fiber as the transmission medium, - 
each HDT 200 can interface to up to 128 RU's 600. 
However it shall be easily understood that any greater 
or lesser number could be used without departing from 
the scope of the present invention. Each RU 600 
35 includes terminations for copper and fiber cables, 
electronics for signal conversion between PDN media 
(e.g. optical fiber, coaxial cable or other passive 
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transmission media) and subscriber in-house wiring, and 
electronics for multiplexing, digital-to-analog 
conversion, signalling and testing. RU's 600 can be 
physically located at each individual subscriber premise 
5 or in a curbs ide unit which is shared between multiple 
subscribers (as is shown in Figure 1), in which case 
each RU would house the interface to multiple subscriber 
telephone lines. 

Figure 2 shows a simplified view of a representative 
10 telecommunications system connected in a point-to- 

raultipoint configuration, including an HDT 300, PDN 500, 
and multiple RU's 600. Each PDN 500 is comprised of a 
shared feeder 520 which transmits multiplexed data 
messages to and from all RU's 600 to HDT 300. Each RU 
15 has an associated distribution path 540, or branch of 
the PDN dedicated to deliver and transit data messages 
to and from that RU. 

Downstream 522 (from HDT 300 to RU's 600) and 
upstream 524 (from RU's 600 to HDT 300) data messages 
20 between HDT 300 and all RU's 600 are multiplexed over 
the shared feeder 520 section of the PDN 500. The 
present invention can be used with either Time Division 
Multiplexing (TDM), Code Division Multiplexing (CDM), or 
any other appropriate multiplexing scheme. For the sake 
25 of illustration the preferred embodiment will be 

described with respect to TDM. Splitter 530 splits PDN 
500 into multiple branches or distribution 540 sections, 
each associated with a particular RU 600. The RU's are 
programmed to extract downstream messages and insert 
30 upstream messages into the appropriate TDM slot for tRat 
RU as is well-known in the art. HDT 300 is programmed 
to insert data bound for a particular RU into the 
appropriate downstream time slot. Upstream messages 
received at HDT 300 from the RU's 600 are identified as 
35 to their source by the location of the m ssages in the 
TDM frame. 

Performance monitoring in a point-to-multipoint 
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system such as that shown in Figures 1 and 2 represents 
a major challenge. Equipment failures in the PDN 
network can occur which cause errors to be introduced in 
the downstream and upstream data messages. Each RU in a 
5 point-to-multipoint system does not have a dedicated 
communication link to HDT 300. Instead all RU's have a 
common path to HDT 300, that is shared feeder 520. This 
makes it difficult in a point-to-multipoint network 
configuration to identify the part of the network in 
10 which the eguipment failure is located. Specifically, 
it is not readily discernable, from the error code alone 
whether a particular error or group of errors was 
introduced by shared feeder 520, one of the dedicated 
distribution sections 540 of PDN 500 or by one of the 
15 RU's themselves. 

The present system provides a nonintrusive means to 
passively monitor the performance of a PDN system and to 
isolate the location of equipment failures in a PDN 
system, with minimal impact on the overall performance 
20 of the network. To do so, the present system recognizes 
and makes use of the fact that different equipment 
failures and the errors they produce manifest themselves 
in unique error distributions across all RU's over time. 
For example, a certain type of error distribution 
25 reveals that an equipment failure is located in the 
shared feeder section of the PDN, a different type of 
error distribution reveals that an equipment failure is 
located in one of the dedicated distribution sections of 
the PDN, and a still different type error distribution 
30 reveals that a failure lies within one of the RU's 
themselves . 

Referring now to Figure 3, the HDT hardware used to 
generate and analyze the error distributions will now be 
described. To generate the error distribution, HDT 300 
35 includes an error code generator 310, which generates 
multiple error codes, one for each RU, which are 
inserted into the downstream traffic path via 
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multiplexor 320. Each RU has a unique error code 
associated with it, and extracts and decodes the 
appropriate error code from the downstream traffic path. 
Each downstream error code corresponds to a data packet 
5 or message sent to an RU in the downstream traffic path 
and is generated using conventional error code 
generating techniques. It shall be understood that the 
present invention is not limited to the use of any 
particular type of error code, however some example 

10 error codes known in the art which can be used with the 
present invention include parity, Cyclic Redundance 
Check (CRC), Single Error Correction Double Error 
Detection (SECDED) codes, among others. 

Figure 4 shows the relevant hardware of an RU 600. 

15 RU 600 receives the downstream traffic and the 

respective error code is decoded by decoder 610. Error 
code generator 640 receives the upstream data messages 
from the subscribers and generates a corresponding 
upstream error code using any of the known error code 

20 techniques. Error code generator 640 also receives the 
result of the downstream decode and modifies the 
generated upstream error code to indicate an error if a 
downstream error was received. Mux 630 inserts the 
upstream error code into a unique position in the 

25 upstream traffic path that is unique to that RU. 

Referring again to Figure 3, the upstream error 
codes from each of the RU's are received at HDT 300 and 
the upstream error codes are decoded by decoder 340. 
HDT 300 includes processor 330 which receives and 

30 accumulates the upstream decode information from all Che 
RU's over a specified period of time. Software data 
processing techniques shown and described below with 
respect to Figure 9 are used to generate an error 
distribution from the accumulated error data. Analysis 

35 and interpretation of the resulting error distribution 
via the software data processing techniques shown in 
Figure 9, as discussed below, reveals the number and 
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frequency of errors that occurred, and identifies the 
relative type, or "shape" of the resulting error 
distribution. Depending upon the type of error 
distribution identified, the location of the equipment 
5 failure is isolated to either the shared feeder 520 

section of PDN 500, one of the distribution 540 sections 
of PDN 500, or one of the RU's 600. 

To isolate the location of equipment failures, the 
present invention has discovered that four commonly 
10 occurring equipment failures in a point-to-multipoint 
PDN configuration result in four different types, or 
"shapes" of error distributions as seen across all RU's 
over time. 

The process by which the present system analyzes and 

15 isolates the location of equipment failures in a point- 
to-multipoint configured network will now be described 
with respect to Figures 5-8 and the flow diagram shown 
in Figure 9. Figures 5-8 show representative error 
distributions associated with four commonly occurring 

20 equipment failures in a point-to-multipoint PDN network. 
It shall be understood that the example error 
distributions shown in Figures 5-8 are presented herein 
to show the general shape of the error distributions 
under certain equipment failure conditions and are not 

25 necessarily to scale. 

The control software shown in flow diagram form in 
Figure 9 runs in processor 330 (shown in Figure 3). As 
shown in the flow diagram of Figure 9, the first 
processing step in the present system is to receive user 

30 settable parameters. In the preferred embodiment of the 
present invention, certain parameters are settable by 
the user to achieve the desired degree of accuracy 
required with a particular network application. For 
example, the minimum acceptable BER and an alarm 

35 triggering BER are both settable by the user for 

customizing the present system for a particular user or 
network ne ds and requirements. Assume for purposes of 
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discussion and not by way of limitation that the minimal 
acceptable Bit Error Rate, or BER, in the examples of 
Figures 5-8 is set at 10~ 10 , and that the alarm 
triggering BER is set at 10" 3 . 
5 Referring again to Figure 9, after the user settable 

parameters are received and programmed, the upstream 
error codes are received and decoded. The error data is 
accumulated over a predetermined period of time. After 
this time period is complete, an error distribution 

10 across all RU's is generated from the accumulated error 
data in accordance with conventional data processing 
techniques. Next, the resulting error distribution is 
analyzed to identify the type or "shape" of the error 
distribution. The type of error distribution identified 

15 indicates an isolated portion of the PDN network where a 
particular equipment failure is located. For types of 
error distributions which commonly occur in a point-to- 
multipoint network and the associated equipment failures 
indicated and isolated thereby will now be discussed. 

20 Figure 5 shows a first type of error distribution. 

It shows an example of an even distribution of errors 
reported across all RU's . Notice that the BER i6 
greater than 10' 10 , the minimal number of acceptable 
errors in this example. This error distribution reveals 

25 that shared feeder 520 section of PDN 500 is the source 
of the errors. This is because feeder 520 is common for 
all RU's 600 associated with that shared PDN. 
Therefore, an equipment failure in shared feeder 520 
will cause errors to be randomly distributed across all 

30 RU's. Thus, the resulting distribution, such as that- 
shown in Figure 5, shows an evenly distributed 
unacceptably high level of errors across all RU's. 

The error handling procedure which occurs upon 
identifying this first type of error distribution and 

35 isolating the equipment failure shown in Figure 5 is 
shown in Figure 9. Preferably shared feeder 520 is 
equipment protected with a standby unit, and the 
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recognition of the equipment failure as determined by 
the present performance monitoring and failure isolation 
system can be used to cause a protection switch to the 
standby unit, thus minimizing network downtime. A type 
5 1 alarm is raised by processor 330 to alert maintenance 
personnel that a type 1 equipment failure was detected 
and that a protection switch occurred so that the failed 
equipment can be appropriately repaired or replaced. 
Figure 6 shows a second type of error distribution 
10 having an acceptable BER (e.g., less than 10~ 10 in this 
example) for all RU's except one (RU number 56 in this 
example), which has an unacceptably high BER of almost 
10~ 3, This error distribution reveals that some equipment 
associated with that RU, i.e., either the RU itself or 
15 the associated dedicated distribution path 540, is the 
location of the equipment failure. Because each 
distribution path 540 is dedicated to a single RU, 
equipment failures in a distribution section of PDN 500 
cause errors to occur only in data messages traveling to 
20 or from that particular RU. Thus, the resulting error 
distribution such as that shown in Figure 6 shows an 
acceptable rate of errors for all RU's with an error 
peak at the particular RU with which the failure is 
associated. The error handling procedure for a type 2 
25 error distribution is shown in Figure 9. An alarm is 
raised by processor 330 to notify maintenance personnel 
of the type 2 error found and identifying the suspect 
RU. In addition appropriate interactive maintenance 
routines known in the art are run by maintenance 
30 personnel to determine whether the equipment failure is 
located in the RU itself or in the dedicated 
distribution path. 

For certain types of RU equipment failures, a failed 
RU erroneously reads from or writes into a TDM time slot 
35 assigned to a different RU. Such an equipment failure 
results in a dually peaked error distribution such as 
that shown in Figure 7. One of the BER peaks represents 
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the failed RU and the other BER peak represents the 
overwritten RU. This dually peaked error distribution 
indicates that an equipment failure is located in one of 
the RU's having a BER peak. The two peaks must be of 
5 the same magnitude to ensure that the problem is 

appropriately identified. If the problem is that an RU 
is reading or writing into another RU's time slot, the 
resulting errors apparent in the two RU's error 
distributions will be of the same magnitude. This 

10 magnitude is in the preferred embodiment settable by the 
user and is shown as 10" 3 in the example of Figure 7. 
When this type 3 error distribution occurs, the error 
handling procedure shown in Figure 9 raises an alarm to 
alert maintenance personnel of the type 3 error 

15 occurred. Appropriate interactive maintenance routines 
known in the art can be run by maintenance personnel 
between HDT 300 and the suspect RU's can be run to 
determine which of the suspect RU's contains the error 
causing equipment failure. 

20 A fourth type of equipment failure results in an RU 

randomly reading to or writing from the TDM time slots 
of all the other RU's. Figure 8 shows a typical error 
distribution for this fourth type of equipment failure. 
Figure 8 reveals an unacceptably heavy BER across all 

25 RU's, with a BER error peak at one particular RU (72 in 
this example). If the errors represented by the error 
peak reach a certain level, then this error distribution 
reveals that the peaking RU itself contains an equipment 
failure. In Figure 8, for example, the number of errors 

30 occurring over all RU's represented by the error curve 
must be of the same magnitude as the number of errors 
represented by the error peak at RU 72. In other words, 
the total errors represented by shaded area 810 should 
be of the same magnitude as the total number of errors 
35 represented by shaded area 812. Checking the magnitude 
of these errors ensures that the correct problem is 
identified. If the magnitudes are comparable then the 
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problem can be properly identified as one RU (72 in the 
example of Figure 8) randomly writing or reading to the 
time slots of other RU's. If the magnitudes are not 
comparable; then the error distribution shown in Figure 
5 8 may simply be a variation of the distribution shown in 
Figure 5. For the fourth type of equipment failure 
identified by the error distribution of Figure 8, an 
alarm is raised to alert maintenance personnel of the 
type of failure of which occurred and to identify the 

10 suspect RU so that appropriate interactive maintenance 
routing known in the art between the RU and the head end 
can be performed, as shown in Figure 9. 

The above described performance monitoring and 
failure isolation system enjoys several advantages over 

15 existing methods. For example, the present method 
raises an immediate alarm condition to report errors. 
The present invention also passively and unobtrusively 
monitors the signal integrity across the PDN with only 
minimal impact on overall network performance. This is 

20 opposed to existing schemes which down the entire PDN 
network and initiate interactive polling of all RU's to 
determine the location of the problem, resulting in 
extreme and undesirable reductions in network 
performance. 

25 The information obtained by the performance 

monitoring and failure isolation system of the present 
invention is very useful for protection switching. For 
example, if the error distribution reveals that the 
error source is located in shared feeder section 520, 

30 that information can be used to initiate a switch to a 
standby shared feeder unit as described above. 

In most PDN systems, the downstream and upstream 
passive distribution media are switched together as 
pairs. Thus for most applications the present invention 

35 need only locate which section, either shared feeder or 
distribution, contains the equipment failure and not 
whether the error source is in the downstream or 
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upstream data traffic path. 

However, the present invention does provide a means 
for determining whether the error source is in the 
downstream or upstream data traffic path for those 
5 applications such as maintenance and troubleshooting 
where such information is required. Upon receipt and 
decode of the downstream error code, each RU sets an 
internal error flag, shown in Figure 4, indicating 
whether a downstream error was received. Later, if HDT 
10 300 receives error indication from the RU, HDT 300 can 
interactively poll the respective RU according to the 
flow diagram shown in Figure 10 to determine whether a 
downstream error occurred. The HDT 300 polls the RU to 
get the value of the RU internal error flag. If the 
15 flag is set, the error occurs in the downstream traffic 
path. If the error flag is not set, no downstream error 
occurred, meaning that the error was introduced in the 
upstream traffic path. Maintenance procedures 
appropriate to the given problem can be performed by 
20 maintenance personnel. 

It shall be understood that the present performance 
monitoring and failure isolation system is not dependent 
on any particular type of error detection code used. 
The type of error detection selected will typically 
25 depend on the network's available bandwidth. For 
example, the typical network has little additional 
bandwidth available for insertion of error detection 
codes. In that case, a parity error detection scheme, 
i.e., a single bit error detection code, is 
30 advantageous. It is conceivable, however, that some PDN 
networks have more bandwidth available. In such a case, 
a more complex error detection/correction scheme 
requiring more bits, such as Single Error Correction 
Double Error Detection (SECDED), Cyclic Redundancy Check 
35 (CRC), or other more complex error codes could be used. 
Moreover, the type of error detection mechanism 
implement d does not change the fundamental nature of 



WO 94/29983 



PCT/US93/11048 



16 

the point-to-multipoint performance monitoring and 
failure isolation system of the present invention. The 
use of either simple error schemes such as parity type 
codes and the use of more complex error 
5 detection/correction codes are anticipated in and are 
within the scope of the present invention. 

The specific type of error distribution analysis or 
data processing methods used on the received error codes 
is also not a limiting factor in the present invention. 

10 Although the preferred embodiment uses, among other 
methods, an analysis of looking for a predetermined 
threshold of the proportion of total errors to the 
proportion of errors associated with a particular RU, 
many different types of analysis on the error data could 

15 be performed to arrive at the same result without 
departing from the spirit and scope of the present 
invention . 

It shall also be understood that various aspects of 
the performance monitoring and failure isolation system 

20 of the present invention may be used individually if 
desired to achieve differing levels of functionality in 
a particular system. For example, if it is not 
necessary to differentiate downstream or upstream 
equipment failures, the system could be assembled 

25 without the RU error flag- Also, generation, decode and 
analysis of the upstream error codes alone could be used 
to achieve reduced levels of performance monitoring and 
failure isolation. 

Although a specific embodiment has been illustrated 

30 and described for the preferred embodiment of the 

present invention set forth herein, it will be readily 
apparent to those of skill in the art that many 
modifications and alterations to the preferred 
embodiment as described herein are possible without 

35 deviating from the scope and spirit of the present 
invention. Special conditions employed for the 
implementation of the preferred embodiment are not 
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intended to be limiting and are easily adaptable to 
alternate implementations. For example, the control 
structure of the present invention could be implemented 
using microprocessor based architectures and logic 
5 functions, including the use of external computer 
control, RAM microcode control, PLA or PAL logic 
structures and hardwired or software controlled state 
machines. Furthermore, the present invention is in no 
way limited to a certain analytical method used to 

10 arrive at and interpret the error distribution across 
the PDN. Thus, it will be understood that many 
modifications will be readily apparent to those of 
ordinary skill in the art, and that this application is 
intended to cover any adaptations or variations thereof. 

15 Therefore, it is manifestly intended that this invention 
be limited only by the claims and the equivalents 
thereof. 
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WHAT IS CLAIMED IS: 

1. A system for performance monitoring a network 

and isolating the location of equipment failures 
therein, the network including a head end and a 
5 plurality of remote units connected with a distribution 
means for transmitting data messages between the head 
end and the plurality of remote ends, comprising: 

first generating means in the head end for 
generating a plurality of first error codes based on a 
10 plurality of downstream data messages; 

a plurality of first decoding means, each in a 
different one of the plurality of remote units, each for 
decoding one of the plurality of first error codes and 
determining therefrom whether a downstream error 
15 occurred; 

a plurality of generating means, each in a different 
one of the plurality of remote units, each for 
generating one of a plurality of error codes 
corresponding to an upstream data message; 
20 decoding means in the head end for decoding the 

plurality of error codes received from the plurality of 
remote units and producing therefrom a plurality of 
error data; 

processing means in the head end for producing an 
25 error distribution from the plurality of error data; and 
means for analyzing the error distribution to 
isolate the location of equipment failures . 

2. The system of claim 1 wherein the distribution 
30 means further includes: 

a shared feeder connected to the head end; and 

a plurality of distribution branches, each connected 

to the shared feeder and to a different one of the 

plurality of remote units. 

35 

3. The system of claim 2 wherein the means for 
analyzing further includes isolating the equipment 
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failure to the shared feeder, to one of the distribution 
branches / or to one or more of the plurality of remote 
units . 

5 4. The system of claim 2 wherein the means for 

analyzing further includes means for identifying when 
the error distribution takes the form of a uniform and 
unacceptably high level of errors across all remote 
units . 

10 

5. The system of claim 4 wherein the means for 

analyzing further includes means for isolating the 
equipment failure to the shared feeder. 

15 6. The system of claim 2 wherein the means for 

analyzing further includes means for identifying when 
the error distribution takes the form of an unacceptably 
high level of errors for two of the plurality of remote 
units and an acceptable level of errors for the 

20 remaining plurality of remote units ♦ 

7. The system of claim 6 wherein the means for 
analyzing further includes means for isolating the 
equipment failure to the two remote units having the 

25 unacceptably high level of errors. 

8. The system of claim 7 wherein the means for 
analyzing further includes means for identifying when 
the magnitudes of the unacceptably high level of errors 

30 for the two of the plurality of remote units are 
equivalent. 

9. The system of claim 2 wherein the means for 
analyzing further includes means for identifying when 

35 the error distribution takes the form of an unacceptably 
high level of errors for one of the plurality of remote 
units and an acceptable level of errors for the 
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remaining plurality of remote units. 



10. The system of claim 9 wherein the means for 

analyzing further includes means for isolating the 
5 equipment failure to the remote unit having the 

unacceptably high level of errors or to the distribution 
branch connected thereto. 



11. The system of claim 2 wherein the means for 

10 analyzing further includes means for identifying when 
the error distribution takes the form of a first 
unacceptably high level of errors for one of the 
plurality of remote units and a second unacceptably high 
level of errors for the remaining plurality of remote 
15 units, wherein the second unacceptably high level of 

errors is less than the first unacceptably high level of 
errors . 



12. The system of claim 11 wherein the means for 

20 analyzing further includes means for isolating the 
equipment failure to the remote unit having the first 
unacceptably high level of errors. 



13. The system of claim 12 wherein the means for 

25 analyzing further includes means for identifying when 
the first unacceptably high level of errors is of the 
same magnitude as the total number of errors occurring 
across all the remaining plurality of remote units. 

30 14. The system according to claim 4 wherein the - 

means for analyzing further includes means for 
determining whether the unacceptably high level of 
errors is reached by comparison to a user settable 
maximum acceptable bit error rate. 

35 

15. The system according to claim 11 wherein the 

means for analyzing further includes means for 



WO 94/29983 



PCTWS93/11048 



21 

determining whether the first unacceptably high level of 
errors is reached by comparison to a user settabl alarm 
triggering bit error rate, and for determining whether 
the second unacceptably high level of errors is reached 
5 by comparison to a user settable maximum acceptable bit 
error rate. 

16. The system of claim 1 wherein the distribution 
means is comprised of optical fiber. 

10 

17. A system for isolating the location of 
equipment failures in a telecommunications system 
including a head end, a plurality of remote ends and a 
distribution means for transmitting messages between the 

15 head end and the plurality of remote units, the 

distribution means including a shared feeder section 
connected to the head end and a plurality of 
distribution sections each connected to the shared 
feeder and to a different one of the plurality of remote 

20 units , the system comprising? 

means in the head end for generating a plurality of 
downstream error codes, each based on a different one of 
a plurality of downstream messages? 

means in each of the remote units for receiving a 

25 different one of the plurality of downstream messages 
and the respective one of the plurality of downstream 
error codes; 

means in each of the remote units for decoding the 
received downstream error code and determining therefrom 
30 whether a downstream error occurred; 

error flag means in each of the remote units for 
indicating whether a downstream error occurred; 

means in each of the remote units for generating one 
of a plurality of upstream error codes each based on one 
35 of a plurality of upstream messages and on the decoded 
downstream error code; 

means in the head end for receiving the plurality of 
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upstream messages and the plurality of upstream error 
codes ; 

means in the head end for decoding the plurality of 
upstream error codes, determining therefrom whether any 
5 upstream errors occurred, and producing therefrom 
upstream error data; 

accumulating means for accumulating the upstream 
error data over a selected period of time and producing 
therefrom accumulated error data? 
10 means for producing an error distribution across the 

plurality of remote ends from the accumulated error 
data; and 

means for analyzing the error distribution and 
identifying therefrom a shape of the error distribution 
15 indicative of the location of an equipment failure. 

18. The system according to claim 17 further 
including means for setting a maximum acceptable bit 
error rate. 

20 

19. The system according to claim 18 further 
including means for setting an alarm triggering bit 
error rate. 

25 20. The system according to claim 18 wherein the 

maximum acceptable bit error rate is user settable. 

21. The system according to claim 19 wherein the 
alarm triggering bit error rate is user settable. 

30 

22. The system according to claim 18 wherein the 
means for analyzing further includes means for 
identifying when the error distribution shows a uniform 
level of errors across the plurality of remote units 

35 that is higher than the maximum acceptable bit error 
rate. 



i 
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23. The system according to claim 22 wherein the 

means for analyzing further includes means for isolating 
the equipment failure to the shared feeder section. 

5 24. The system according to claim 19 wherein the 

means for analyzing further includes means for 
identifying when the error distribution shows a first 
level of errors associated with one of the plurality of 
remote units that is higher than the alarm triggering 
10 bit error rate and a uniform level of errors across the 
remaining plurality of remote units at a second level, 
that is lower than the maximum acceptable bit error 
rate. 

15 25. The system according to claim 24 wherein the 

means for analyzing further includes means for 
identifying when the equipment failure to the one of the 
plurality of remote units having the first level of 
errors or to the distribution section connected thereto. 

20 

26. The system according to claim 19 wherein the 
means for analyzing further includes means for 
identifying when the error distribution shows a first 
level of errors associated with two of the plurality of 

25 remote units that is higher than the alarm triggering 
bit error rate and a uniform level of errors across the 
remaining plurality of remote units at a second level 
that is lower than the maximum acceptable bit error 
rate. 

30 

27. The system according to claim 26 wherein the 
means for analyzing further includes means for . 
identifying when the equipment failure to the two of the 
plurality of remote units having the first level of 

35 errors or to the distribution sections connected 
thereto . 
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28. The system according to claim 18 wherein the 
means for analyzing further includes means for 
identifying when the error distribution shows a first 
level of errors associated with one of the plurality of 

5 remote units that is higher than the maximum acceptable 
bit error rate and a uniform level of errors across the 
remaining plurality of remote units at a second level 
that is higher than the maximum acceptable bit error 
rate and lower than the first level of errors. 

10 

29. The system according to claim 28 wherein the 
means for analyzing further includes means for isolating 
the equipment failure to the one of the plurality of 
remote units having the first level of errors or to the 

15 distribution section connected thereto. 

30. The system according to claim 17 further 
including means for polling the error flag means to 
determine whether a downstream or an upstream error 

20 occurred. 

31. A method of analyzing an error distribution to 
isolate the location of equipment failures in a 
communications system , the communications system 

25 including a distribution network including a shared 
feeder connected to a head end and a plurality of 
distribution sections , each connected to the shared 
feeder and to a different one of a plurality of remote 
units , the method comprising the steps ofs v 

30 (a) setting a maximum acceptable bit error rate? " 

(b) setting an alarm triggering bit error rate, 
wherein the alarm triggering bit error rate is higher 
than the maximum acceptable bit error rate; 

(c) detecting errors received from the plurality of 
35 remote units and generating therefrom a set of error 

data; 

(d) generating an error distribution from a set of 
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error data received from a plurality of remote units; 

(e) comparing the error distribution with the 
maximum acceptable bit error rate and the alarm 
triggering bit error rate; and 
5 (f) isolating an equipment failure based on the 

outcome of comparing step (e). 

32. The method according to claim 31 wherein 

isolating step (f) further includes the step of 
10 isolating a first equipment failure if the result of 

comparison step (d) shows an even distribution of errors 
across all remote units at a level that is higher than 
the maximum acceptable bit error rate. 

15 33. The method according to claim 32 wherein 

isolating step (f ) further includes the step of 
isolating the equipment failure to the shared feeder 
section. 

20 34. The method according to claim 31 wherein 

isolating step (f) further includes the step of 
isolating a second equipment failure if the result of 
comparison step (e) shows a level of errors that is 
higher than the alarm triggering bit error rate for one 

25 of the plurality of remote units and an even 

distribution of errors across the remaining plurality of 
remote units at a level that is lower than the maximum 
acceptable bit error rate. 

30 35. The method according to claim 34 wherein 

isolating step (f) further includes the step of 
isolating the equipment failure to the one of the 
plurality of remote units. 

35 36. The method according to claim 31 wherein 

isolating step (f) further includes the step of 
isolating a third equipment failure if the result of 
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comparison step (e) shows a level of errors that is 
higher than the alarm triggering bit error rate for two 
of the plurality of remote units and an even 
distribution of errors across the remaining plurality of 
5 remote units at a level that is lower than the maximum 
acceptable bit error rate. 



37. The method according to claim 34 wherein 

isolating step (f) further includes the step of 
isolating the eguipment failure to the two of the 
plurality of remote units. 



38. The method according to claim 31 wherein 

isolating step (f) further includes the step of 
isolating a fourth equipment failure if the result of 
comparison step (e) shows a first rate of errors for one 
of the plurality of remote units that is higher than the 
maximum acceptable bit error rate and an even 
distribution of errors across the remaining plurality of 
remote units at a second rate of errors that is higher 
than the maximum acceptable bit error rate, wherein the 
second rate of errors is lower than the first rate of 
errors . 



39. The method according to claim 38 wherein 

isolating step (f) further includes the step of 
isolating the eguipment failure to the one of the 
plurality of remote units. 



30 40 • A method for monitoring the performance of a" 

telecommunications system connected in a point-to- 
multipoint configuration, and for isolating the location 
of eguipment failures therein, the telecommunications 
system including a distribution network including a 

35 shared feeder section connected to a head end and a 

plurality of distribution sections each connected to the 
shared feeder section and to a different one of a 
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plurality of remote units, the method comprising the 
steps of: 

(a) generating a plurality of first error codes 
based on a plurality at the head end of downstream 

5 messages; 

(b) transmitting each of the plurality of error 
codes to a different one of a plurality of remote units 
over the distribution network? 

(c) decoding at each remote unit each of the error 
10 codes and downstream messages to determine whether any 

downstream errors occurred; 

(d) generating at each remote unit a plurality of 
second error codes based on the decoded downstream error 
codes and on a plurality of upstream messages, wherein 

15 each of the plurality of second error codes is generated 
in a different one of the plurality of remote units; 

(e) transmitting the plurality of second error 
codes and corresponding upstream messages to the head 
end over the distribution network; 

20 (f) accumulating the plurality of error codes 

received from the plurality of remote units over a 

selected period of time; 

(g) generating an error distribution from the 

accumulated error codes; and 
25 (h) analyzing the error distribution and isolating 

therefrom the location of equipment failures. 

41. The method according to claim 40 further 
including the step of setting a downstream error flag in 

30 the remote unit if the result of decoding step (c) 
indicates that a downstream error occurred. 

42. The method according to claim 41 further 
including the step of polling the downstream error flag 

35 from the head end to determine whether an equipment 
failure is in a downstream traffic path or an upstream 
traffic path. 
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43. The method according to claim 40 further 

including the steps ofs 

(i) setting a maximum acceptable bit error rate; 
5 (j) setting an alarm triggering bit error rate, 

wherein the alarm triggering bit error rate is higher 
than the maximum acceptable bit error rate; 

(k) comparing the error distribution with the 
maximum acceptable bit error rate and the alarm 
10 triggering bit error rate; 

(1) isolating an equipment failure based on the 
outcome of comparing step (k) . 

44. The method according to claim 43 wherein said 

15 isolating step (1) further includes the step of 

isolating a first equipment failure if the result of 
comparison step (k) shows an even distribution of errors 
across all remote units at a rate that is higher than 
the maximum acceptable bit error rate. 



20 



25 



30 



35 



45. The method according to claim 44 wherein 
isolating step (1) further includes the step of 
isolating the equipment failure to the shared feeder 
section. 

46. The method according to claim 43 wherein said 
isolating step (1) further includes the step of 
isolating a second equipment failure if the result of 
comparison step (k) shows a rate of errors that is 
higher than the alarm triggering bit error rate for one 
of the plurality of remote units and an even 
distribution of errors across the remaining plurality of 
remote units at a rate that is lower than the maximum 
acceptable bit error rate. 

47. The method according to claim 46 wherein 

isolating step (1) further includes the step of 
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isolating the equipment failure to the one of the 
plurality of remote units. 

48. The method according to claim 43 wherein said 
5 isolating step (1) further includes the step of 

isolating a third equipment failure if the result of 
comparison step (k) shows a rate of errors that is 
higher than the alarm triggering bit error rate for two 
of the plurality of remote units and an even 
10 distribution of errors across the remaining plurality of 
remote units at a rate that is lower than the maximum 
acceptable bit error rate. 

49. The method according to claim 48 wherein 
15 isolating step (1) further includes the step of 

isolating the equipment failure to the two of the 
plurality of remote units ♦ 

50. The method according to claim 43 wherein said 
20 isolating step (1) further includes the step of 

isolating a fourth equipment failure if the result of 
comparison step (k) shows a first rate of errors that is 
higher than the maximum acceptable bit error rate for 
one of the plurality of remote units and an even 
25 distribution of errors across the remaining plurality of 
remote units at a second rate of errors that is higher 
than the maximum acceptable bit error rate, wherein the 
second rate of errors is lower than the first rate of 
errors . 

30 

51. The method according to claim 50 wherein 
isolating step (1) further includes the step of 
isolating the equipment failure to the one of the 
plurality of remote units. 

35 

52. A system for monitoring the performance of and 
isolating equipment failures in a telecommunications 
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network, the telecommunications network including a head 
end connected to a plurality of remote units by a 
passive optical network, the passive optical network 
comprised of a first optical fiber connected to the head 
5 end and a plurality of second optical fibers, each 

connected to the first optical fiber and to a different 
one of the plurality of remote units, the system 
comprising j 

a first error code generator in the head end; 
10 a plurality of first decoders, each in a different 

one of the plurality of remote units, each connected to 
receive and decode one of a plurality of first error 
codes ; 

a plurality of second error code generators, each in 
15 a different one of the plurality of remote units; 

a second decoder in the head end connected to 
receive and decode a plurality of second error codes; 

processing means in the head end, connected to 
receive the plurality of second error codes, for 
20 producing an error distribution from the plurality of 
second error codes; the processing means further for 
identifying a particular type of error distribution and 
isolating therefrom the location of equipment failures 
in the telecommunications network. 

25 

53. A method of analyzing an error distribution to 

isolate the location of equipment failures in a point- 
to-multipoint telecommunications network, comprising the 
steps oft 

30 (a) identifying a type of error distribution; 

(b) isolating from the type of error 
distribution identified in step (a) the location of 
equipment failures in the telecommunications network. 
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